141 research outputs found
WordSup: Exploiting Word Annotations for Character based Text Detection
Imagery texts are usually organized as a hierarchy of several visual
elements, i.e. characters, words, text lines and text blocks. Among these
elements, character is the most basic one for various languages such as
Western, Chinese, Japanese, mathematical expression and etc. It is natural and
convenient to construct a common text detection engine based on character
detectors. However, training character detectors requires a vast of location
annotated characters, which are expensive to obtain. Actually, the existing
real text datasets are mostly annotated in word or line level. To remedy this
dilemma, we propose a weakly supervised framework that can utilize word
annotations, either in tight quadrangles or the more loose bounding boxes, for
character detector training. When applied in scene text detection, we are thus
able to train a robust character detector by exploiting word annotations in the
rich large-scale real scene text datasets, e.g. ICDAR15 and COCO-text. The
character detector acts as a key role in the pipeline of our text detection
engine. It achieves the state-of-the-art performance on several challenging
scene text detection benchmarks. We also demonstrate the flexibility of our
pipeline by various scenarios, including deformed text detection and math
expression recognition.Comment: 2017 International Conference on Computer Visio
Learning Independent Instance Maps for Crowd Localization
Accurately locating each head's position in the crowd scenes is a crucial
task in the field of crowd analysis. However, traditional density-based methods
only predict coarse prediction, and segmentation/detection-based methods cannot
handle extremely dense scenes and large-range scale-variations crowds. To this
end, we propose an end-to-end and straightforward framework for crowd
localization, named Independent Instance Map segmentation (IIM). Different from
density maps and boxes regression, each instance in IIM is non-overlapped. By
segmenting crowds into independent connected components, the positions and the
crowd counts (the centers and the number of components, respectively) are
obtained. Furthermore, to improve the segmentation quality for different
density regions, we present a differentiable Binarization Module (BM) to output
structured instance maps. BM brings two advantages into localization models: 1)
adaptively learn a threshold map for different images to detect each instance
more accurately; 2) directly train the model using loss on binary predictions
and labels. Extensive experiments verify the proposed method is effective and
outperforms the-state-of-the-art methods on the five popular crowd datasets.
Significantly, IIM improves F1-measure by 10.4\% on the NWPU-Crowd Localization
task. The source code and pre-trained models will be released at
\url{https://github.com/taohan10200/IIM}
Some variational recipes for quantum field theories
Rapid developments of quantum information technology show promising
opportunities for simulating quantum field theory in near-term quantum devices.
In this work, we formulate the theory of (time-dependent) variational quantum
simulation of the 1+1 dimensional quantum field theory
including encoding, state preparation, and time evolution, with several
numerical simulation results. These algorithms could be understood as near-term
variational analogs of the Jordan-Lee-Preskill algorithm, the basic algorithm
for simulating quantum field theory using universal quantum devices. Besides,
we highlight the advantages of encoding with harmonic oscillator basis based on
the LSZ reduction formula and several computational efficiency such as when
implementing a bosonic version of the unitary coupled cluster ansatz to prepare
initial states. We also discuss how to circumvent the "spectral crowding"
problem in the quantum field theory simulation and appraise our algorithm by
both state and subspace fidelities.Comment: 28 pages, many figures. v2: modified style, add references, clear
typos. v3: significant change, authors adde
- …